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ABSTRACT 

Long non-coding RNAs (IncRNAs) and microRNAs 
(miRNAs) represent two classes of important 
non-coding RNAs in eukaryotes. Altliougli these 
non-coding RNAs have been implicated in organis- 
mal development and in various human diseases, 
surprisingly little is known about their transcrip- 
tional regulation. Recent advances in chromatin 
immunoprecipitation with next-generation DNA 
sequencing (ChlP-Seq) have provided methods of 
detecting transcription factor binding sites (TFBSs) 
with unprecedented sensitivity. In this study, we 
describe ChlPBase (http://deepbase.sysu.edu.cn/ 
chipbase/), a novel database that we have de- 
veloped to facilitate the comprehensive annotation 
and discovery of transcription factor binding maps 
and transcriptional regulatory relationships of 
IncRNAs and miRNAs from ChlP-Seq data. The 
current release of ChlPBase includes high- 
throughput sequencing data that were generated 
by 543 ChlP-Seq experiments in diverse tissues 
and cell lines from six organisms. By analysing 
millions of TFBSs, we identified tens of thousands 
of TF-lncRNA and TF-miRNA regulatory relation- 
ships. Furthermore, two web-based servers were 
developed to annotate and discover transcriptional 
regulatory relationships of IncRNAs and miRNAs 
from ChlP-Seq data. In addition, we developed two 
genome browsers, deepView and genomeView, to 
provide integrated views of multidimensional data. 
Moreover, our web implementation supports diverse 
query types and the exploration of TFs, IncRNAs, 
miRNAs, gene ontologies and pathways. 



INTRODUCTION 

It has become increasingly clear that eukaryotic genomes 
encode thousands of long non-coding RNAs (IncRNAs) 
and microRNAs (miRNAs) (1-4). Emerging evidence is 
revealing that IncRNAs and miRNAs serve as the nodes 
of signaling networks that regulate cancer, apoptosis, pro- 
hferation, differentiation and stem cell biology (1,2,5-8). 
However, the majority of studies that address these types 
of RNAs focus on defining the regulatory functions of 
IncRNAs and miRNAs, whereas few investigations are 
directed toward assessing how the IncRNA and miRNA 
genes themselves are transcriptionally regulated. 

The major limitation in identifying the transcriptional 
regulatory relationships of IncRNAs and miRNAs has 
been the high false-positive rates of predictive algorithms 
for transcription factor binding sites (TFBSs) (9). 
Recently, chromatin immunoprecipitation with massively 
parallel DNA sequencing (ChlP-Seq) has provided a 
powerful way to identify TFBSs. The application of the 
ChlP-Seq technique has significantly reduced the rate of 
false-positive predictions of TFBSs (10-12). However, 
although ChlP-Seq technology can reliably identify 
TFBSs, few studies have used ChlP-Seq data to explore 
the transcriptional regulation of IncRNAs and miRNAs. 
For this reason, a high-quahty integrated database that 
could facihtate the annotation and analysis of the tran- 
scriptional regulation of IncRNAs and miRNAs from 
ChlP-Seq data will be of great utility in the study of 
both the regulation of IncRNAs and miRNAs by TFs 
and the roles of this regulation in human diseases. 

In this study, we developed the ChlPBase to facilitate 
the integrative and interactive display, as well as the 
comprehensive annotation and discovery, of TF-lncRNA 
and TF-miRNA interaction maps from ChlP-Seq 
data that were generated from diverse tissues and ceU 
fines from six organisms: human, mouse, dog, chicken, 
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Drosophila melanogaster and Caenorhabditis elegans 
(Figure 1). ChlPBase contains tens of thousands of 
TF-lncRNA and TF-miRNA regulatory relationships, as 
well as miUions of TFBSs (Table 1). In addition, two novel 
web servers and two genome browsers were developed to 
comprehensively explore the relationships of TFs and 
ncRNAs from ChlP-Seq data. 

MATERIALS AND METHODS 

A total of 543 ChlP-Seq peak data sets for 252 different 
transcription factors were compiled from multiple related 
studies and downloaded from the NCBl GEO database 
(13), the ENCODE (14) and modENCODE (15,16) data- 
bases, or the Supplementary Data of the original research 
articles (Supplementary Table SI). We have also manually 
curated metadata (such as TF name, refSeq accession 
number, gene symbols and detailed descriptions and ex- 
pression patterns of TFs) to ensure annotation consist- 
ency. These peak data sets were converted to latest 
genome version using HftOver tool from the UCSC 
genome browser website (17), and peaks whose genomic 
regions could not be transformed into latest version of the 
genome were discarded. In addition, some data sets in 
BedGraph format were downloaded to construct peak 
tracks and displayed in our deepView browser to allow 
users to check TFBSs. In each species, TFBSs from dif- 
ferent transcription factors and many different cell lines 



were combined and sorted according to their genomic pos- 
itions. And then, the overlapping TFBSs were grouped 
into clusters and were imported into database; each 
cluster included at least one TFBS. Known transcription 
factor binding matrices were downloaded from the 
JASPAR (18), Transfac (19), Cistrome (20) and 
UniPROBE (21) databases. 

All of the known IncRNAs or large intergenic 
non-coding RNAs were downloaded from the 
Supplementary Data of the six original research articles 
that addressed these RNAs (22-27) or extracted from 
Ensembl (28), refSeq (17) and UCSC Bioinformatics 
website (17). Known functional IncRNAs were down- 
loaded from IncRNAdb database (29). All of the known 
miRNAs were downloaded from miRBase [release 17.0, 
(30)]. miRNA targets were downloaded from starBase 
database (31). All of the refSeq genes were downloaded 
from the UCSC bioinformatics websites (17). Other 
known non-coding RNAs were downloaded from the 
Ensembl database (28) or the UCSC websites (17) or 
were obtained from the relevant literature. The human 
(UCSC hgl9), mouse (UCSC mm9, NCBl Build 37), 
dog (UCSC canFam2), chicken (UCSC galGaO), D. 
melanogaster (UCSC dm3) and C. elegans (WS190) 
genome sequences were downloaded from the UCSC bio- 
informatics websites (17). 

Pre-miRNAs were grouped into transcriptional units. 
TFs might not almost exclusively bind at proximal 
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Figure 1. A system-level overview of the core framework of ChlPBase. All of the results that are generated by ChlPBase are deposited in MySQL 
relational databases and displayed in the visual browser and web page. 
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Table 1. The data that are incorporated into ChlPBase 



Species 


Experiments 


TFBSs 


TFBS cluster 


TF-miRNA 


TF-lncRNA 


Human 


329 


6134098 


1 525 778 


41 809 


640 741 


Mouse 


119 


2 042 652 


478 331 


7306 


109 383 


Dog 


2 


65 674 


52 368 


149 


/ 


Chicken 


3 


40 635 


37 108 


391 


/ 


D. melanogaster 


53 


160202 


29 565 


1788 


98 710 


C. elegans 


37 


216992 


42 666 


1790 


/ 



These statistics indicate the numbers of sequencing experiments (ChlP-Seq), TFBSs, TFBS clusters (transcription factor binding clusters), 
TF-lncRNA interactions and TF-miRNA interactions that are incorporated into ChlPBase. These data are from six organisms, namely, human, 
mouse, dog, chicken, D. melanogaster and C. elegans. Known IncRNAs for dog, chicken and C. elegans are not available, and therefore the 
TF-lncRNA interactions for these species are denoted as '/' in the table. 



promoters of IncRNAs or miRNAs. For protein-coding 
genes, more than half of the observed binding events are 
distal events (32). Moreover, the distances between 
transcription initiation sites (TSSs) and miRNA genes 
dramatically vary, ranging from a hundred bases to 
thousands of bases upstream (33,34). To incorporate 
proximal and distal binding events (32), for intergenic 
miRNAs, the 30-kb region upstream and the 10-kb 
region downstream of the TSS of the first pre-miRNA in 
the same transcriptional unit were chosen as the regula- 
tory domain (32) of the examined miRNAs. For intronic 
miRNAs, the 30-kb region upstream and the 10-kb region 
downstream of TSS of the host genes contained miRNAs 
were chose as the regulatory domain of the examined 
miRNAs. The same strategy was used for IncRNAs; we 
chose a 30-kb region upstream and a 10-kb region down- 
stream of the TSS of each IncRNA as the regulatory 
domain (32) of each IncRNA. Five-kb upstream region 
and 1-kb downstream region of each IncRNA and 
miRNA were chosen as promoter region (32). In each 
species, regulatory domains/regions of each IncRNA/ 
miRNA were intersected with TFBSs of each data set to 
identify TFs that regulated the examined ncRNAs. And 
then, TFBSs overlapping with regulatory domains and 
corresponding IncRNAs were imported into MySql 
database. 

DATABASE CONTENT 

The genome-wide binding maps and high-occupancy 
target regions of transcription factors 

We integrated ~8.7 milhon TFBSs from 543 ChlP-Seq 
experiments in various tissues or ceh hues to provide com- 
prehensive genome-wide transcription factor binding 
profiles. To provide more useful information, we 
generated extensive annotations and analyses for tran- 
scription factors and TFBSs. For each TFBS, we 
identified the nearest/target gene and the distance 
between the site and the gene, as well as the expression 
pattern of the TF and its target gene in various tissues or 
cell lines (Supplementary Figure SI). For each ChlP-Seq 
experiment, we identified the distribution of TFBSs in the 
body of the gene and the distribution of the distances of 
the TFBSs that are associated with the TSSs of the nearest 
genes, and we provided descriptions of the ChlP-Seq 



experiments and the expression patterns of the TFs 
(Supplementary Figure S2). In addition, we offered the 
chipGO and chipKEGG tools to explore the features of 
the hsts of TF-target interactions that are derived from the 
ChlP-Seq data (Figure 1 and Supplementary Figure S3). 

To directly investigate potential high-occupancy target 
(HOT) regions on a genome-wide scale, we grouped ~8.7 
million TFBSs into ~2 million clusters (Table 1). Each 
cluster contains between 1 and 74 transcription factors. 
We designated the genomic locations that were bound 
by many TFs as HOT regions. For instance, we identified 
26664 HOT regions that were bound by >15 factors in the 
human genome. In addition, we generated distribution 
maps of the numbers of transcription factors in the 
clusters. The maps are presented in the form of cluster 
peaks, which are displayed in our deepView genome 
browser (Figure 1 and Supplementary Figure S4). This 
display method allows us easily to determine HOT 
regions of TFs. We also identified the nearest/target 
genes of these clusters and created a web interface to 
display this information (Figure 1 and Supplementary 
Figure S4). 

The annotation and identification of TF-IncRNA and 
TF-miRNA regulatory relationships 

To investigate TF-lncRNA and TF-miRNA regulatory 
relationships, the regulatory domains (see the Materials 
and Methods section for a detailed description) of 
IncRNAs and miRNAs were intersected with all TFBSs 
from diverse tissues and cell lines. In total, we identified 
~848 834 TF-lncRNA regulatory relationships between 
221 TFs and 38 293 IncRNA transcripts, as weU as 
53233 TF-miRNA regulatory relationships between 249 
TFs and 2294 miRNA clusters (Table 1). Because of its 
integration of the large number of high-resolution 
ChlP-Seq data from diverse tissues and cell lines, this 
analysis provides an enhanced resolution of these regula- 
tory relationships. Moreover, to enable us to explore the 
interplay between miRNA transcriptional and 
posttranscriptional regulation, we integrated the targets 
of miRNAs from our starBase database (31) into the 
TF-miRNA networks. Cytoscape (web version) (35) 
were used to display and draw the TF-miRNA and 
miRNA-target networks. 
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WEB INTERFACE 

The use of the deep View and genome View genome 
browsers for the comparative analysis of TFBSs and the 
TF-lncRNA or TF-miRNA regulatory relationships 

The large quantity of TFBSs and high-throughput 
ChlP-Seq data has increased the demand for visual tools 
that allow for the rapid visual correlation of different 
types of information. To enable the user to browse seam- 
lessly along the genome and to zoom effortlessly in a very 
large set of ChlP-Seq data, our improved deepView 
genome browser (31,36) was developed. This browser 
provides an integrated view of TFBSs, IncRNAs, 
miRNAs, protein-coding genes, TF cluster peaks and 
TF clusters (Figure 2). In the deepView genome 
browser, the 'zoom out' or 'zoom in' button can be used 
to extend or shrink the width of the displayed coordinate 
range. A click on a track item (e.g. a miRNA, IncRNA or 
TFBS) of interest launches a multiple-alignment trace 
viewer that displays aU of the traces that are relevant to 
the item in question or hnks to external resources, such as 
NCBT UCSC and miRBase, that can be used to obtain 
more comprehensive information. 

To provide the whole-genome-scale visualization of 
large-scale TFBSs, miRNAs and IncRNAs, a new 



genome browser, genomeView, was developed in this 
study (Figure 3). The user of this browser can view data 
for a single ChlP-Seq experiment across the entire genome 
in the context of miRNAs or IncRNAs. TFBSs and 
miRNAs or IncRNAs are displayed for each location in 
the genome as a profile over the chromosome ideogram. 
This feature allows the user to quickly observe 
genome-scale patterns in the regulatory data and identify 
regions of interest for further visualization in our 
deepView genome browser. 

The web-based exploration of TF-lncRNA and 
TF-miRNA regulatory relationships 

We provide two web interfaces, LncRNA and 
MicroRNA, which may be used to display the 
TF-lncRNA and TF-miRNA interaction relationships, re- 
spectively (Supplementary Figures S5 and S6). Users can 
browse the relationships by entering a IncRNA name. 
When a user starts typing a IncRNA name in the search 
box, suggested IncRNA names are displayed in the list 
box. The user can then either choose a IncRNA from 
the Hst box or finish typing a fuU gene name. The user 
can also select a TF and search for IncRNAs that are 
regulated by the selected TF. If users do not enter 
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Figure 2. Illustrative screen shots from the deepView browser. The deepView browser provides TFBSs that have been identified from ChlP-Seq data, 
predicted TFBSs, IncRNAs, miRNAs, protein-coding genes and TFBS clusters. 
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Figure 3. Snapshot of the genomeView browser. The genomeView browser provides the whole-genome-scale visualization of large-scale TFBSs, 
miRNAs and IncRNAs. 



liicRNA name and TF name, webpage will output all the 
TF-lncRNA interactions. Users can download these inter- 
actions to construct more complex networks composed by 
dozens to hundreds of IncRNAs. The results of the search 
are hsted as the TF-lncRNA table. For the IncRNA inter- 
face, the numbers of TFBSs for each IncRNA in are 
indicated in a table. The users can click on a number 
within the table to launch a detailed page that provides 
further information about the TF-lncRNA interaction in 
question. The user can also click on the title of the table to 
sort TF-lncRNA interactions according to various 
features, such as the number of TFBSs, the IncRNA 
names or the TF names. The detailed information for a 
TF-lncRNA interaction includes a description of the TF 
gene and its distance to the start site of the IncRNA 
(Supplementary Figure S5). The 'references' section 
enables the retrieval of the primary articles yielding the 
annotation data. Click the article title hnk to visit the 
NCBI PUBMED website. 



The niicroRNA interface is organized similarly to the 
LncRNA interface. The user can select a miRNA and a 
TF gene from a drop-down menu to explore TF-miRNA 
interactions. The numbers of upstream and downstream 
TFBSs, the genomic coordinates and the distance to 
the start site of the miRNA are all presented in a table 
(Supplementary Figure S6A and B). In addition, we have 
constructed a webpage. Networks, to simply display 
TF-miRNA, miRNA-target and TF-target interactions 
(37) using cytoscape (web version) (35) by integrating 
our starBase database. Users can select different miRNA 
target regulated by examined miRNA to construct regu- 
latory networks (Supplementary Figure S6C). For 
example, we recapitulated the pubhshed c-Myc, E2F1 
and miR-20 network (38) by selecting hsa-miR-20a and 
miRNA-target gene E2F1 in Networks webpage 
(Supplementary Figure S6D). 

The interface of the transcriptional regulatory for other 
ncRNAs (such as snoRNAs, tRNAs, snRNAs, etc.) is also 
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provided and organized similarly to the IncRNA and 
miRNA interfaces. Users can explore their regulatory 
interactions by similar ways. 

The web-based annotation of transcription factor 
binding regions 

We also provide the annotatedTool program, which offers 
a simple and user-friendly interface to annotate transcrip- 
tion factor binding regions (TFBRs). The user is required 
to select an intended organism and annotated TSSs of 
known protein-coding genes, IncRNAs or miRNAs and 
then upload TFBRs in the browser extensible (BED) 
format. After the user has completed the data submission, 
a typical iteration of the annotatedTool program may 
require several minutes to finish. The output of this 
program consists of three parts: the distribution of dis- 
tances between the center of the TFBR and the TSS, in- 
formation about the nearest gene and a hnk to the 
deepView genome browser, which allows the user to 
view various features of each target region 
(Supplementary Figure S7). 

EXAMPLE APPLICATIONS 

In the following section, we will present several example 
apphcations of ChlPBase. 

hsa-miR-122, a target of liver-enriched 
transcription factors 

Let us assume that we are interested in liver-specific TFs 
as transcriptional regulators of miR-122, which is also ex- 
pressed in the hver. We select three liver-enriched TFs 
(HNF4A, CEBPA and HNF3B/FoxA2) and the 
miR-122 gene in the micro RNA webpage. The results 
page summarizes all of the query results: (i) there are 
five ChlP-Seq experiments for these three TFs 
(Supplementary Figure S8A and B); (ii) there are 
HNF4A ChlP-Seq data from three different experiments; 
and (iii) HNF4A and HNF3B/FoxA2 have multiple 
binding sites in the regulatory domain of miR-122 
(Supplementary Figure S8A and B). We navigate to the 
corresponding deepView genome browser by locating the 
miR-122 regulatory domain, which opens up a genome 
browser view that effectively recapitulates the published 
TFBSs (39) (Figure 4). 

IncRNAs as targets of embryonic stem cell 
transcription factors 

To relate a hicRNA gene to the core transcriptional 
circuitry of embryonic stem (ES) ceUs, we select nine 
pluripotency-associated transcription factors (including 
Oct4, Sox2, Nanog, c-Myc, n-Myc, Klf4, Zfx, E2F1 and 
Smadl) in the mouse genome at the IncRNA webpage. 
The results page summarizes the pluripotency-associated 
transcription factors that bind in the regulatory domains 
of IncRNAs. A click on hncl428, a known ES-ceUs- 
associated IncRNA, launches a deepView genome 
browser view that also recapitulates the pubhshed 
TFBSs of E2F1, n-Myc and Klf4 (25) (Figure 5). 



DISCUSSIONS AND CONCLUSIONS 

Ultra-high-throughput next-generation sequencing tech- 
nology has recently been developed for mapping TFBSs 
(10,11). In this study, we performed a large-scale integra- 
tion of public TFBSs that have been generated by high- 
throughput ChlP-Seq technology and provide the most 
comprehensive TF data set for various cell types that are 
available at the present time. We also provide comprehen- 
sive transcriptional regulatory maps of IncRNAs and 
miRNAs by connecting TFs to these non-coding genes. 

The transcriptional regulation of the majority of 
miRNAs and almost aU of the discovered IncRNAs is cur- 
rently unknown. Recent studies have revealed that the de- 
regulation of miRNAs and IncRNAs is correlated with 
various human cancers and diseases (6,7), and that this 
deregulation is often due to the aberrant expression of 
TFs (39). In the current study, we developed the 
ChlPBase database to decode the transcriptional regula- 
tion of IncRNA and miRNA genes from ChlP-Seq data. 
We can use ChlPBase to recapitulate the known transcrip- 
tional regulatory relationships of miRNAs and IncRNAs. 
For example, ChlPBase can be used to identify that the 
liver-specific miR-122 is regulated by three liver-enriched 
TFs (39), and that the linc2048 IncRNA is regulated by 
embryonic stem cell (ESC)-associated transcription 
factors (25). In addition, the integration of a large 
quantity of ChlP-Seq data from diverse tissues and cell 
Hues allows us to provide enhanced resolution and novel 
findings. 

In comparison with other sources, for elucidating the 
transcriptional regulation of IncRNAs and miRNAs, or 
storing and analyzing ChlP-Seq data, the distinctive 
features in our ChlPBase database are as follows, (i) 
Our ChlPBase database is the first database that 
provides the transcriptional regulation maps for IncRNA 
genes, (ii) The other databases that are related to tran- 
scriptional regulation for miRNAs, including transmiR 
(40) and CircuitsDB (41), only collect computationally 
predicted or experimentally supported TF-miRNA inter- 
actions. By contrast, ChlPBase provides the comprehen- 
sive TF-miRNA regulatory relationships that have been 
identified from high-throughput ChlP-Seq data. The 
entries in TransmiR database contain only the name of 
TFs and their corresponding target miRNAs. The detail 
information of TFBSs and TFs, however, is not included. 
Also, the TransmiR database may not contain the com- 
prehensive target miRNAs of the corresponding TFs. We 
used two TFs, E2F1 and MYC, whose target miRNAs are 
the most comprehensive in TransmiR to perform compari- 
son between TransmiR database and our ChlPBase 
database. When considered only the relationships of TFs 
and target miRNAs, ChlPBase could identify 77% (20/26) 
E2Fl-miRNA and 75% (21/28) MYC-miRNA relation- 
ships. These data indicated that ChlPBase could recover 
majority of TF-miRNA interactions documented in 
TransmiR. Moreover, our database also identifies tens 
of novel E2Fl-miRNA and MYC-miRNA relationships 
that were not included in transmiR data. In addition, 
TransmiR does not contain HN4FA-miR-122 and 
CEBPA-miR-122 interactions described in our Example 
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Figure 4. Liver-enriched transcription factors regulate the expression of hsa-miR-122. The liver-enriched transcription factors (HNF4A, CEBPA and 
HNF3B/FoxA2) bind to the promoter regions of the primary transcript (XLOC_012693) of hsa-miR-122. 
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Applications sections, (iii) limChIP (42) is a database of 
genome-wide ChlP data in human (ligl8 version) and 
mouse (mm8 version). It just provides the Protein-DNA 
binding intensities from individual samples for 
user-provided genomic regions. Currently, hmChIP does 



not explore the transcriptional regulatory of IncRNAs 
or niiRNAs, even for protein-coding genes. By contrast, 
our ChlPBase provides comprehensive transcriptional 
regulatory relationships of IncRNAs, niiRNAs and 
other ncRNAs, as well as comprehensive annotation of 
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TFBSs from 543 ChlP-Seq data sets from six organisms, 
(iv) To enable the user to browse seamlessly along the 
genome and to zoom effortlessly in a very large set of 
ChlP-Seq data, our improved deep View genome browser 
was developed to provide an integrated view of TFBSs 
that have been identified from ChlP-Seq data, predicted 
TFBSs, ncRNAs, protein-coding genes and TFBS clusters 
(Figures 2 and 3). (v) We developed two web tools, 
annotatedTool and genomeViewer to annotate and 
discover the transcriptional regulatory relationships of 
IncRNAs and miRNAs from ChlP-Seq data (Figure 1). 
(vi) We constructed genome-wide transcription factor 
binding profiles from ChlP-Seq data. Combinatorial tran- 
scription factor interactions that control the transcrip- 
tional regulations of IncRNAs and miRNAs were easily 
identified by searching for appropriate profiles in the 
genome browser (Figure 2). (vii) ChlPBase also provides 
the gene ontology annotation, biological pathways and 
expression patterns of transcription factor binding targets 
(Figure 1). This supplementary information may provide 
valuable insights into the function of each TF, IncRNA 
and miRNA. Finally, the data and the integrative, inter- 
active and versatile displays that are provided by the 
ChlPBase database will aid future experimental and 
computational studies in their attempts to elucidate the 
regulation of IncRNAs and miRNAs by TFs and assess 
the roles of these regulatory relationships in human 
diseases. 



FUTURE DIRECTIONS 

As a means of comprehensively integrating 
ChlP-Seq data, ChlPBase is expected to provide consid- 
erable resources to assist researchers that are investigating 
the TF-lncRNA and TF-miRNA regulatory networks and 
examining the biological functions of the genes 
and ncRNAs with expression levels that are controlled 
by transcription factors. As ChlP-Seq technology is 
applied to a broader set of species, cell fines, tissues 
and conditions, ChlPBase will continue to be developed 
and refined toward the achievement of the following 
goals: (i) the better integration and cross-comparison of 
diverse ChlP-Seq data sets and data resources, (ii) the 
correlation of these diverse ChlP-Seq data 
with IncRNAs and miRNAs. (iii) we will continue to 
extend the amount of storage space and improve the 
performance of our computer servers for storing 
and analysing these new data, and improve the database 
to accept upload of new data by the users. In addition, 
we intend to integrate the epigenomic data that are 
generated by ChlP-Seq technology into ChlPBase to 
improve our understanding of eukaryotic regulatory 
networks. 
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ChlPBase is freely available at http://deepbase.sysu.edu. 
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